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5 s o 
m - rff 52 



ENTERED 



3 <110> APPLICANT: DUNICAN, L.K. 

4 MCCORMACK , ASHLING 

5 STAPELTON/ CLIONA 

6 BURKE, KEVIN 

7 MOCKEL, BETTINA 

9 <120> TITLE OF INVENTION: NEW NUCLEOTIDE SEQUENCES WHICH CODE FOR THE TAL GENE 
11 <130> FILE REFERENCE: MAS/21123/258100 

13 <140> CURRENT APPLICATION NUMBER: 09/531,266 

14 <141> CURRENT FILING DATE: 2000-03-20 

16 <150> PRIOR APPLICATION NUMBER: 60/142,915 

17 <151> PRIOR FILING DATE: 1999-07-09 
19 <160> NUMBER OF SEQ ID NOS : 6 

21 <170> SOFTWARE: Patentln Ver. 2.1 
23 <210> SEQ ID NO: 1 
.24 <211> LENGTH: 6995 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Corynebacterium glutamicum 

28 <220> FEATURE: 

29 <221> NAME/KEY: CDS 

30 <222> LOCATION: ( 2471) ..( 3550 ) 

31 <223> OTHER INFORMATION: tal-Gen 

33 <400> SEQUENCE: 1 

34 cacatttgaa ccacagttgg ttataaaatg ggttcaacat cactatggtt agaggtgttg 60 

3 6 acgggtcaga ttaagcaaag actactttcg gggtagatca cctttgccaa atttgaacca 120 
38 attaacctaa gtcgtagatc tgatcatcgg atctaacgaa aacgaaccaa aactttggtc 180 
40 ccggtttaac ccaggaagga ttgaccacct tgacgctgtc acctgaactt caggcgctca 240 
42 ctgtacgcaa ttacccctct gattggtccg atgtggacac caaggctgta gacactgttc 300 
44 gtgtcctcgc tgcagacgct gtagaaaact gtggctccgg ccacccaggc accgcaatga 360 
46 gcctggctcc ccttgcatac accttgtacc agcgggttat gaacgtagat ccacaggaca 420 

4 8 ccaactgggc aggccgtgac cgcttcgttc tttcttgtgg ccactcctct ttgacccagt 4 80 
50 acatccagct ttacttgggt ggattcggcc ttgagatgga tgacctgaag gctctgcgca 540 
52 cctgggattc cttgacccca ggacaccctg agtaccgcca caccaagggc gttgagatca 600 
54 ccactggccc tcttggccag ggtcttgcat ctgcagttgg tatggccatg gctgctcgtc 660 
56 gtgagcgtgg cctattcgac ccaaccgctg ctgagggcga atccccattc gaccaccaca 720 
58 tctacgtcat tgcttctgat ggtgacctgc aggaaggtgt cacctctgag gcatcctcca 780 
60 tcgctggcac ccagcagctg ggcaacctca tcgtgttctg ggatgacaac cgcatctcca 840 
62 tcgaagacaa cactgagatc gctttcaacg aggacgttgt tgctcgttac aaggcttacg 900 
64 gctggcagac cattgaggtt gaggctggcg aggacgttgc agcaatcgaa gctgcagtgg 960 
66 ctgaggctaa gaaggacacc aagcgaccta ccttcatccg cgttcgcacc atcatcggct 1020 
68 tcccagctcc aactatgatg aacaccggtg ctgtgcacgg tgctgctctt ggcgcagctg 1080 
70 aggttgcagc aaccaagact gagcttggat tcgatcctga ggctcacttc gcgatcgacg 1140 
72 atgaggttat cgctcacacc cgctccctcg cagagcgcgc tgcacagaag aaggctgcat 1200 
74 ggcaggtcaa gttcgatgag tgggcagctg ccaaccctga gaacaaggct ctgttcgatc 1260 
76 gcctgaactc ccgtgagctt ccagcgggct acgctgacga gctcccaaca tgggatgcag 1320 
78 atgagaaggg cgtcgcaact cgtaaggctt ccgaggctgc acttcaggca ctgggcaaga 1380 
80 cccttcctga gctgtggggc ggttccgctg acctcgcagg ttccaacaac accgtgatca 1440 
82 agggctcccc ttccttcggc cctgagtcca tctccaccga gacctggtct gctgagcctt 1500 
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84 acggccgtaa cctgcacttc ggtatccgtg agcacgctat gggatccatc ctcaacggca 1560 
86 tttccctcca cggtggcacc cgcccatacg gcggaacctt cctcatcttc tccgactaca 1620 
88 tgcgtcctgc agttcgtctt gcagctctca tggagaccga cgcttactac gtctggaccc 1680 
90 acgactccat cggtctgggc gaagatggcc caacccacca gcctgttgaa accttggctg 1740 
92 cactgcgcgc catcccaggt ctgtccgtcc tgcgtcctgc agatgcgaac gagaccgccc 1800 
94 aggcttgggc tgcagcactt gagtacaagg aaggccctaa gggtcttgca ctgacccgcc 1860 
96 agaacgttcc tgttctggaa ggcaccaagg agaaggctgc tgaaggcgtt cgccgcggtg 1920 
98 gctacgtcct ggttgagggt tccaaggaaa ccccagatgt gatcctcatg ggctccggct 1980 
100 ccgaggttca gcttgcagtt aacgctgcga aggctctgga agctgagggc gttgcagctc 2040 
102 gcgttgtttc cgttccttgc atggattggt tccaggagca ggacgcagag tacatcgagt 2100 
104 ccgttctgcc tgcagctgtg accgctcgtg tgtctgttga agctggcatc gcaatgcctt 2160 
106 ggtaccgctt cttgggcacc cagggccgtg ctgtctccct tgagcacttc ggtgcttctg 2220 
108 cggattacca gaccctgttt gagaagttcg gcatcaccac cgatgcagtc gtggcagcgg 2280 
110 ccaaggactc cattaacggt taattgccct gctgttttta gcttcaaccc ggggcaatat 2340 
112 gattctccgg aattttattg ccccgggttg ttgttgttaa tcggtacaaa gggtcttaag 2400 
114 cacatccctt acttgcctgc tctccttgag cacagttcaa gaacaattct tttaaggaaa 2460 

116 atttagtttc atg tct cac att gat gat ctt gca cag etc ggc act tec 2509 

117 Met Ser His lie Asp Asp Leu Ala Gin Leu Gly Thr Ser 



118 








1 








5 










10 










120 


act 


tgg 


etc 


gac 


gac 


etc 


tec 


cgc 


gag 


cgc 


att 


act 


tec 


ggc 


aat 


etc 


2557 


121 


Thr 


Trp 


Leu 


Asp 


Asp 


Leu 


Ser 


Arg 


Glu 


Arg 


He 


Thr 


Ser 


Gly 


Asn 


Leu 




122 




15 










20 










25 












124 


age 


cag 


gtt 


att 


gag 


gaa 


aag 


tct 


gta 


gtc 


ggt 


gtc 


acc 


acc 


aac 


cca 


2605 


125 


Ser 


Gin 


Val 


He 


Glu 


Glu 


Lys 


Ser 


Val 


Val 


Gly 


Val 


Thr 


Thr 


Asn 


Pro 




126 


30 










35 










40 










45 




128 


get 


att 


ttc 


gca 


gca 


gca 


atg 


tec 


aag 


ggc 


gat 


tec 


tac 


gac 


get 


cag 


2653 


129 


Ala 


He 


Phe 


Ala 


Ala 


Ala 


Met 


Ser 


Lys 


Gly 


Asp 


Ser 


Tyr 


Asp 


Ala 


Gin 




130 










50 










55 










60 






132 


ate 


gca 


gag 


etc 


aag 


gee 


get 


ggc 


gca 


tct 


gtt 


gac 


cag 


get 


gtt 


tac 


2701 


133 


He 


Ala 


Glu 


Leu 


Lys 


Ala 


Ala 


Gly 


Ala 


Ser 


Val 


Asp 


Gin 


Ala 


Val 


Tyr 




134 








65 










70 










75 








136 


gee 


atg 


age 


ate 


gac 


gac 


gtt 


cgc 


aat 


get 


tgt 


gat 


ctg 


ttc 


acc 


ggc 


2749 


137 


Ala 


Met 


Ser 


He 


Asp 


Asp 


Val 


Arg 


Asn 


Ala 


Cys 


Asp 


Leu 


Phe 


Thr 


Gly 




138 






80 










85 










90 










140 


ate 


ttc 


gag 


tec 


tec 


aac 


ggc 


tac 


gac 


ggc 


cgc 


gtg 


tec 


ate 


gag 


gtt 


2797 


141 


He 


Phe 


Glu 


Ser 


Ser 


Asn 


Gly 


Tyr 


Asp 


Gly 


Arg 


Val 


Ser 


He 


Glu 


Val 




142 




95 










100 










105 












144 


gac 


cca 


cgt 


ate 


tct 


get 


gac 


cgc 


gac 


gca 


acc 


ctg 


get 


cag 


gec 


aag 


2845 


145 


Asp 


Pro 


Arg 


He 


Ser 


Ala 


Asp 


Arg 


Asp 


Ala 


Thr 


Leu 


Ala 


Gin 


Ala 


Lys 




146 


110 










115 










120 










125 




148 


gag 


ctg 


tgg 


gca 


aag 


gtt 


gat 


cgt 


cca 


aac 


gtc 


atg 


ate 


aag 


ate 


cct 


2893 


149 


Glu 


Leu 


Trp 


Ala 


Lys 


Val 


Asp 


Arg 


Pro 


Asn 


Val 


Met 


He 


Lys 


He 


Pro 




150 










130 










135 










140 






152 


gca 


acc 


cca 


ggt 


tct 


ttg 


cca 


gca 


ate 


acc 


gac 


get 


ttg 


get 


gag 


ggc 


2941 


153 


Ala 


Thr 


Pro 


Gly 


Ser 


Leu 


Pro 


Ala 


He 


Thr 


Asp 


Ala 


Leu 


Ala 


Glu 


Gly 




154 








145 










150 










155 








156 


ate 


age 


gtt 


aac 


gtc 


acc 


ttg 


ate 


ttc 


tec 


gtt 


get 


cgc 


tac 


cgc 


gag 


2989 


157 


He 


Ser 


Val 


Asn 


Val 


Thr 


Leu 


He 


Phe 


Ser 


Val 


Ala 


Arg 


Tyr 


Arg 


Glu 




158 






160 










165 










170 
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160 


gtc 


ate 


get 


gcg 


ttc 


ate 


gag 


ggc 


ate 


aag 


cag 


get 


get 


gca 


aac 


ggc 


3037 


161 


Val 


He 


Ala 


Ala 


Phe 


He 


Glu 


Gly 


He 


Lys 


Gin 


Ala 


Ala 


Ala 


Asn 


Gly 




162 




175 










180 










185 












164 


cac 


gac 


gtc 


tec 


aag 


ate 


cac 


tct 


gtg 


get 


tec 


ttc 


ttc 


gtc 


tec 


cgc 


3085 


165 


His 


Asp 


Val 


Ser 


Lys 


He 


His 


Ser 


Val 


Ala 


Ser 


Phe 


Phe 


Val 


Ser 


Arg 




166 


190 










195 










200 










205 




168 


gtc 


gac 


gtt 


gag 


ate 


gac 


aag 


cgc 


etc 


gag 


gca 


ate 


gga 


tec 


gat 


gag 


3133 


169 


Val 


Asp 


Val 


Glu 


He 


Asp 


Lys 


Arg 


Leu 


Glu 


Ala 


He 


Gly 


Ser 


Asp 


Glu 




170 










210 










215 










220 






172 


get 


ttg 


get 


ctg 


cgc 


ggc 


aag 


gca 


ggc 


gtt 


gee 


aac 


get 


cag 


cgc 


get 


3181 


173 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Lys 


Ala 


Gly 


Val 


Ala 


Asn 


Ala 


Gin 


Arg 


Ala 




174 








225 










230 










235 








176 


tac 


get 


gtg 


tac 


aag 


gag 


ctt 


ttc 


gac 


gee 


gee 


gag 


ctg 


cct 


gaa 


ggt 


3229 


177 


Tyr 


Ala 


Val 


Tyr 


Lys 


Glu 


Leu 


Phe 


Asp 


Ala 


Ala 


Glu 


Leu 


Pro 


Glu 


Gly 




178 






240 










245 










250 










180 


gec 


aac 


act 


cag 


cgc 


cca 


ctg 


tgg 


gca 


tec 


acc 


ggc 


gtg 


aag 


aac 


cct 


3277 


181 


Ala 


Asn 


Thr 


Gin 


Arg 


Pro 


Leu 


Trp 


Ala 


Ser 


Thr 


Gly 


Val 


Lys 


Asn 


Pro 




182 




255 










260 










265 












184 


gcg 


tac 


get 


gca 


act 


ctt 


tac 


gtt 


tec 


gag 


ctg 


get 


ggt 


cca 


aac 


acc 


3325 


185 


Ala 


Tyr 


Ala 


Ala 


Thr 


Leu 


Tyr 


Val 


Ser 


Glu 


Leu 


Ala 


Gly 


Pro 


Asn 


Thr 




186 


270 










275 










280 










285 




188 


gtc 


aac 


acc 


atg 


cca 


gaa 


ggc 


acc 


ate 


gac 


gcg 


gtt 


ctg 


gag 


cag 


ggc 


3373 


189 


Val 


Asn 


Thr 


Met 


Pro 


Glu 


Gly 


Thr 


He 


Asp 


Ala 


Val 


Leu 


Glu 


Gin 


Gly 




190 










290 










295 










300 






192 


aac 


ctg 


cac 


ggt 


gac 


acc 


ctg 


tec 


aac 


tec 


gcg 


gca 


gaa 


get 


gac 


get 


3421 


193 


Asn 


Leu 


His 


Gly 


Asp 


Thr 


Leu 


Ser 


Asn 


Ser 


Ala 


Ala 


Glu 


Ala 


Asp 


Ala 




194 








305 










310 










315 








196 


gtg 


ttc 


tec 


cag 


ctt 


gag 


get 


ctg 


ggc 


gtt 


gac 


ttg 


gca 


gat 


gtc 


ttc 


3469 


197 


Val 


Phe 


Ser 


Gin 


Leu 


Glu 


Ala 


Leu 


Gly 


Val 


Asp 


Leu 


Ala 


Asp 


Val 


Phe 




198 






320 










325 










330 










200 


cag 


gtc 


ctg 


gag 


acc 


gag 


ggt 


gtg 


gac 


aag 


ttc 


gtt 


get 


tct 


tgg 


age 


3517 


201 


Gin 


Val 


Leu 


Glu 


Thr 


Glu 


Gly 


Val 


Asp 


Lys 


Phe 


Val 


Ala 


Ser 


Trp 


Ser 




202 




335 










340 










345 












204 


gaa 


ctg 


ctt 


gag 


tec 


atg 


gaa 


get 


cgc 


ctg 


aag 


tagaatcagc acgctgcatc 


3570 


205 


Glu 


Leu 


Leu 


Glu 


Ser 


Met 


Glu 


Ala 


Arg 


Leu 


Lys 














206 


350 










355 










360 














208 


agtaacggcg < 


acatgaaatc gaattagttc 


gatcttatgt 


ggccgttaca catctttcat 


3630 


210 


taaagaaagg < 


ategtgacac taccategtg 


agcacaaaca 


cgaccccctc cagctggaca 


3690 


212 


aacccactgc < 


gcgacccgca ggataaacga 


ctcccccgca 


tcgctggccc ttccggcatg 


3750 


214 


gtgatcttcg < 


gtgtcactgg cgacttggct 


cgaaagaagc 


tgctccccgc catttatgat 


3810 


216 


ctagcaaacc < 


geggattget gcccccagga 


ttctcgttgg 


taggttaegg ccgccgcgaa 


3870 


218 


tggtccaaag ; 


aagactttga aaaatacgta 


cgcgatgccg 


caagtgctgg tgctcgtacg 


3930 


220 


gaattccgtg < 


aaaatgtttg ggagcgcctc 


gecgagggta 


tggaatttgt tcgcggcaac 


3990 


222 


tttgatgatg i 


atgcagcttt cgacaacctc 


gctgcaacac 


teaagegcat cgacaaaacc 


4050 


224 


cgcggcaccg < 


seggcaactg ggcttactac 


ctgtccattc 


caccagattc cttcacagcg 


4110 


226 


gtctgccacc < 


agctggagcg ttccggcatg 


gctgaatcca 


ccgaagaagc atggcgccgc 


4170 


228 


gtgatcatcg i 


agaagecttt cggccacaac 


ctcgaatccg 


cacacgagct caaccagctg 


4230 


230 


gtcaacgcag * 


tcttcccaga atcttctgtg 


ttccgcatcg 


accactattt gggcaaggaa 


4290 


232 


acagttcaaa < 


acatcctggc tctgcgtttt 


gctaaccagc 


tgtttgagcc actgtggaac 


4350 
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234 tccaactacg ttgaccacgt ccagatcacc atggctgaag atattggctt gggtggacgt 4410 

236 gctggttact acgacggcat cggcgcagcc cgcgacgtca tccagaacca cctgatccag 4470 

238 ctcttggctc tggttgccat ggaagaacca atttctttcg tgccagcgca gctgcaggca 4530 

240 gaaaagatca aggtgctctc tgcgacaaag ccgtgctacc cattggataa aacctccgct 4590 

242 cgtggtcagt acgctgccgg ttggcagggc tctgagttag tcaagggact tcgcgaagaa 4650 

244 gatggcttca accctgagtc caccactgag acttttgcgg cttgtacctt agagatcacg 4710 

246 tctcgtcgct gggctggtgt gccgttctac ctgcgcaccg gtaagcgtct tggtcgccgt 4770 

248 gttactgaga ttgccgtggt gtttaaagac gcaccacacc agcctttcga cggcgacatg 4830 

250 actgtatccc ttggccaaaa cgccatcgtg attcgcgtgc agcctgatga aggtgtgctc 4 890 

2 52 atccgcttcg gttccaaggt tccaggttct gccatggaag tccgtgacgt caacatggac 4950 

2 54 ttctcctact cagaatcctt cactgaagaa tcacctgaag catacgagcg cctcattttg 5010 

256 gatgcgctgt tagatgaatc cagcctcttc cctaccaacg aggaagtgga actgagctgg 5070 

258 aagattctgg atccaattct tgaagcatgg gatgccgatg gagaaccaga ggattaccca 5130 

260 gcgggtacgt ggggtccaaa gagcgctgat gaaatgcttt cccgcaacgg tcacacctgg 5190 

2 62 cgcaggccat aatttagggg caaaaaatga tctttgaact tccggatacc accacccagc 5250 

264 aaatttccaa gaccctaact cgactgcgtg aatcgggcac ccaggtcacc accggccgag 5310. 

266 tgctcaccct catcgtggtc actgactccg aaagcgatgt cgctgcagtt accgagtcca 5370 

268 ccaatgaagc ctcgcgcgag cacccatctc gcgtgatcat tttggtggtt ggcgataaaa 5430 

270 ctgcagaaaa caaagttgac gcagaagtcc gtatcggtgg cgacgctggt gcttccgaga 5490 

272 tgatcatcat gcatctcaac ggacctgtcg ctgacaagct ccagtatgtc gtcacaccac 5550 

274 tgttgcttcc tgacaccccc atcgttgctt ggtggccagg tgaatcacca aagaatcctt 5610 

276 cccaggaccc aattggacgc atcgcacaac gacgcatcac tgatgctttg tacgaccgtg 5670 

278 atgacgcact agaagatcgt gttgagaact atcacccagg tgataccgac atgacgtggg 5730 

280 cgcgccttac ccagtggcgg ggacttgttg cctcctcatt ggatcaccca ccacacagcg 5790 

282 aaatcacttc cgtgaggctg accggtgcaa gcggcagtac ctcggtggat ttggctgcag 5850 

284 gctggttggc gcggaggctg aaagtgcctg tgatccgcga ggtgacagat gctcccaccg 5910 

286 tgccaaccga tgagtttggt actccactgc tggctatcca gcgcctggag atcgttcgca 5970 

288 ccaccggctc gatcatcatc accatctatg acgctcatac ccttcaggta gagatgccgg 6030 

290 aatccggcaa tgccccatcg ctggtggcta ttggtcgtcg aagtgagtcc gactgcttgt 6090 

292 ctgaggagct tcgccacatg gatccagatt tgggctacca gcacgcacta tccggcttgt 6150 

294 ccagcgtcaa gctggaaacc gtctaaggag aaatacaaca ctatggttga tgtagtacgc 6210 

296 gcacgcgata ctgaagattt ggttgcacag gctgcctcca aattcattga ggttgttgaa 6270 

298 gcagcaactg ccaataatgg caccgcacag gtagtgctca ccggtggtgg cgccggcatc 6330 

300 aagttgctgg aaaagctcag cgttgatgcg gctgaccttg cctgggatcg cattcatgtg 6390 

302 ttcttcggcg atgagcgcaa tgtccctgtc agtgattctg agtccaatga gggccaggct 6450 

304 cgtgaggcac tgttgtccaa ggtttctatc cctgaagcca acattcacgg atatggtctc 6510 

306 ggcgacgtag atcttgcaga ggcagcccgc gcttacgaag ctgtgttgga tgaattcgca 6570 

308 ccaaacggct ttgatcttca cctgctcggc atgggtggcg aaggccatat caactccctg 6630 

310 ttccctcaca ccgatgcagt caaggaatcc tccgcaaagg tcatcgcggt gtttgattcc 6690 

312 cctaagcctc cttcagagcg tgcaactcta acccttcctg cggttcactc cgcaaagcgc 6750 

314 gtgtggttgc tggtttctgg tgcggagaag gctgaggcag ctgcggcgat cgtcaacggt 6810 

316 gagcctgctg ttgagtggcc tgctgctgga gctaccggat ctgaggaaac ggtattgttc 6870 

318 ttggctgatg atgctgcagg aaatctctaa gcagcgccag ctctaacaag aagctttaac 6930 

320 aagaagctct aacgaaaagc actaacaaac taatccgggt gcgaaccttc atctgaatcg 6990 

322 atgga 6995 

325 <210> SEQ ID NO: 2 

326 <211> LENGTH: 360 

327 <212> TYPE: PRT 

328 <213> ORGANISM: Corynebacterium glutamicum 
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330 


<400> SEQUENCE: 


2 
























331 


Met 


Ser 


His 


He 


Asp 


Asp 


Leu 


Ala 


Gin 


Leu 


Gly 


Thr 


Ser 


Thr 


Trp 


Leu 


332 


1 








5 










10 










15 
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